Use audio feature in ASR task template #4006

lhoestq · 2022-03-24T11:15:22Z

The AutomaticSpeechRecognition task template is outdated: it still uses the file path column as input instead of the audio column.

I changed that and updated all the datasets as well as the tests.

The only community dataset that will need to be updated is facebook/multilingual_librispeech. It has almost zero usage unfortunately (probably because users load the duplicate multilingual_librispeech directly instead), but it means we can update it.

(this makes me think that we should deprecate multilingual_librispeech it and redirect users to facebook/multilingual_librispeech).

This PR is also useful for the AudioFolder in #3963

HuggingFaceDocBuilderDev · 2022-03-24T11:24:01Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten

Thanks a lot for fixing all those!

lhoestq added 4 commits March 24, 2022 12:02

use audio feature in ASR task template

4d558c3

update datasets

56796dd

update tests

8c03707

typo

ed39389

lhoestq requested review from polinaeterna and patrickvonplaten March 24, 2022 11:15

mariosasko mentioned this pull request Mar 24, 2022

Use the Audio feature in the AutomaticSpeechRecognition template #3364

Closed

update dataset_infos.json

27fa5ef

patrickvonplaten approved these changes Mar 24, 2022

View reviewed changes

lhoestq merged commit 32e0e79 into master Mar 24, 2022

lhoestq deleted the use-audio-feature-in-ASR-task-template branch March 24, 2022 16:48

lhoestq mentioned this pull request Mar 30, 2022

Deprecate canonical Multilingual Librispeech #4060

Merged

mariosasko mentioned this pull request Jun 1, 2022

Use Audio features for AutomaticSpeechRecognition task template #2536

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use audio feature in ASR task template #4006

Use audio feature in ASR task template #4006

lhoestq commented Mar 24, 2022

HuggingFaceDocBuilderDev commented Mar 24, 2022 •

edited

Loading

patrickvonplaten left a comment

Use audio feature in ASR task template #4006

Use audio feature in ASR task template #4006

Conversation

lhoestq commented Mar 24, 2022

HuggingFaceDocBuilderDev commented Mar 24, 2022 • edited Loading

patrickvonplaten left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 24, 2022 •

edited

Loading